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Description 

This invention relates to the molecular cloning and characterization of a gene sequence coding for 
human relaxin. The invention is also concerned with recombinant DNA techniques for the preparation of 
5 human relaxin, prorelaxin and preprorelaxin. 

In our Australian Patent Application No. 17906/83 (PF 5352/82, filed 12th August, 1982) (EP-A- 
0101309), we described the molecular cloning and characterization of a gene sequence coding for human 
relaxin. We have now found a second gene which also codes for human relaxin. 

More specifically, this invention relates to an isolated and purified (i.e. ."cloned") human gene coding for 
w prorelaxin, preprorelaxin, and the A and/or B and/or C peptide chains of human relaxin, methods for 
isolating and purifying the genes and a method for transferring the genes to and replicating the genes in a 
host cell. The cloned genes are expressed by the host cell when fused with a host-expressable prokaryotic 
or eukaryotic gene. The genes are thus useful in the production of human relaxin for therapeutic purposes. 
The invention also relates to the peptides human relaxin, prorelaxin and preprorelaxin, to the individual 
15 peptide chains which comprise these sequences and to modified forms of these peptides. 

The invention further relates to modified genes coding for the individual relaxin chains and for the 
above-mentioned modified forms. 

[Note: References used in the following description are collected at the end of the description.] 
Pioneering work by Hisaw (1926) suggested an important role for the peptide hormone relaxin in 
20 mammals through its effects in dilating the pubic symphysis, thus facilitating the birth process. Relaxin is 
synthesized and stored in the corpora lutea of ovaries during pregnancy and is released into the blood 
stream prior to parturition. The availability of ovaries has enabled the isolation and amino acid sequence 
determination of relaxin from pig (James et al, 1977; Schwabe et al, 1977) rat (John et al, 1981) and shark 
(Schwabe et al, 1982). The biologically active hormone consists of two peptide chains (known as the A and 
25 B chains) held together by disulphide bonds, two inter-chain and one intra-chain. The structure thus closely 
resembles insulin in the disposition of disulphide bonds which has led to speculation of a common ancestral 
gene for these hormones (James et al, 1977; Schwabe et al, 1977). 

Recombinant DNA techniques have been applied to the isolation of cDNA clones for both rat and 
porcine relaxins (Hudson et al, 1981; Haley et al, 1982), see also Australian Patent Application No. 11834/83 
30 (PF 2695/82) ( = EP-A-00866£9j^ Synthetic undecamer nucleotides, prepared on the basis of amino acid 
.sequence information, were used as primers for the synthesis of cDNA probes greatly enriched in relaxin 
cDNA sequences which identified relaxin cDNA clones in libraries derived from both rat and porcine ovarian 
tissue. The relaxin structural gene was found to code in both cases for a single chain precursor which 
resembles preproinsulin in the overall configuration, i.e., signal peptide/B chain/C peptide/A chain. 
35 - In our Application No. 17906/83 (EP-A-101309) we described the use of probes based on the C peptide 
region of porcine relaxin to select out a relaxin gene from a human genomic library. This approach resulted 
in the successful identification of a genomic clone which we have now designated "H1 n from which the 
structure of the entire coding region of a human preprorelaxin was determined. 

The present invention arises from a continuation of the work described in Application No. 17906/83 (EP- 
40 A-101309) in which we sought to confirm the gene structure described in that application. We have 
investigated cDNA clones in libraries derived from ovarian tissue from pregnant human females using as a 
probe a segment of the previously identified human H1 gene corresponding to approximately 300 
nucleotides of the C peptide/A-chain coding region (amino acids 64-161). Positive cDNA clones were 
isolated and sequencing of these revealed a cDNA sequence which was not identical with the sequence 
45 previously established and which coded for a form of preprorelaxin different to that described in our earlier 
application. 

We have also isolated from the human genomic library described in our copending Australian Patent 
application No. 17906/83 (PF 5352/82) ( = EP-A-0101309) a recombinant phage containing exon 1 of the H2 
gene where exon 1 comprises the coding region of. the signal, B-peptide, and part of the C-peptide similar 
so to that of the Hi-gene. 

It is now believed that either or both the presently-described gene which we have designated "H2" and 
the "H1" gene described in our Application No. 17906/83 ( = EP-A-0101309) expressed in human reproduc- 
tive tissue, for example ovary and placenta, and/or other tissues including but not limited to gut brain and 
skin, since both genes- express peptides with relaxin-like activity. 
55 The corpora lutea of the ovary as well as decidual and placental tissues are the most likely sites for 
' expression of relaxin-related genes. However, in view of the wide distribution of many peptide hormones it 
is highly likely that the relaxin gene is also expressed in non-reproductive tissues, including brain and the 
gastrointestinal tract. Relaxin has the general properties of a growth factor and is capable of altering the 
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nature of connective tissue and influencing smooth muscle contraction. We believe that one or both of the 
gene structures described in this specification and that of Application No. 17906/83 (EP-A-101309) to be 
widely distributed in the body, we suggest that the relaxin peptides expressed from these genes will play an 
important physiological role in addition to their well documented hormonal function during reproduction. 
The following abbreviations are used in this description. 

H1 - the relaxin gene described in Application No. 17906/83 (EP-A-0101309), being deduced from a 
genomic clone. 

H2 - the relaxin gene described herein, being deduced from a cDNA clone. 

DNA - deoxyribonucleic acid A Adenine 

RNA - ribonucleic acid T - Thymine 

cDNA - complementary DNA G - Guanine 

(enzymatically C - Cytosine 

synthesized U - Uracil 

from an mRNA sequence) 
mKNA - messenger RNA 

The coding relationships between nucleotide sequence in DNA and amino acid sequence in protein are 
collectively known as the genetic code, which is set out below. 
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The abbreviations used for the amino acids in the table 
are identified as follows. 
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Each 3-letter codon represented in the tab.e, e.g.. AUG. CAU ^ S %^TZ ^ The et's 
corresponds to a trinucleotide of mRNA, having a 5'-end on the left and a 3 -end on the nght The letters 
sSnd tor the purine or pyridine bases forming the nucleotide sequence. All DNA sequences g.ven he em 
Te lose ofTe strand whose sequence corresponds to the mRNA sequence, with thym.ne CO substituted 

^Te' 1 invention wiU be further described and e.ucidated in the following discussion. Reference will be 
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made to the accompanying drawings in which: 

Figure 1 : shows an abbreviated restriction map and sequencing strategy for the cDNA clone in 
pBR322, genomic clone H11, and GT10 cDNA clones a-f. Arrows indicate the direction of sequencing on 
end-labelled fragments (see methods). GT10 clones a-f were sequenced by subcfoning into an M13 vector 
-5 as described hereinafter. Nucleotides are numbered from the AUG initiation codon, position 1-3, to the 
termination codon, position 554-556. 

Figure 2 : compares the amino acid and mRNA sequence of human preprorelaxin H2 (upper) with the 
corresponding H1 (lower) sequence. The sequences have been aligned to maximize homology with 
nucleotide identities being indicated by asterisks and amino acid homologues by boxed-in areas. Amino 
w acids are numbered from the start of the B-chain (H2 gene sequence starting at -1 and H1 sequence at + 1) 
although this position represents only the hypothetical start of the B chain sequence and has been deduced 
simply from the homology to the related porcine arid rat preprorelaxin structures. The asterisk beneath Ala 
45 in the C peptide denotes the position of an intron in the G/CA codon in both genes. 

Figure 3: autoradiographs of identical nitrocellulose strips taken from either a) Northern gel transfer of 
rs human ovarian RNA or b) Xplaques corresponding to the H1 gene (XH7) or H2 gene (\GTlO-a) using as 
hybridization probes A: a random primed 600 bp H2 relaxin cDNA fragment (72-660), B: H2-specific 25 mer 
(483-507), C: H1 -specific 25 mer (483-507), D: H1 -specific 25 mer (248-272). 

Figure 4 autoradiographs of identical nitrocellulose strips following Northern gel transfer of human 
ovarian RNA using as hybridization probes fragments of the H2 cDNA clone in pBR322 (see Figure 1). 
20 A: 600 bp fragment (72-660) corresponding to most of the coding region 
B: 5' untranslated region (to Hinf I site at nucleotide 30) 
C: 3' untranslated region (from Hinf I site at nucleotide 660) 
D: 3* untranslated region (from Hpa I site at nucleotide 850) 
Figure 5: comparison of the amino acid sequences of the B and A chains between the two human 
25 relaxin genes, human insulin, and other members of the relaxin family. Boxed areas highlight residue which 
are conserved between the two human relaxin genes-and the relaxin family. Arrows indicate probable sites 
of proteolytic cleavage with confirmation by protein sequencing data of the amino terminal residue of the B 
and A chains of porcine (Schwabe et al, 1977; James et al, 1977), rat (John et al, 1981), shark (Schwabe et 
al, 1982) and dogfish relaxins (Schwabe et al, 1983). 
30 The H2 mRNA sequence 'shown in Figure 2 was. determined by the methods described hereinafter. For 
ease of comparison, the numbering of the amino acids previously used for the peptide derived from the H1 
sequence has been maintained in the present description of the H2-derived peptide. The structure of H1- 
preprorelaxin was deduced from the genomic sequence by comparison with the homologous structures of 
pig and rat relaxin. The H2-preprorelaxin structure was deduced by comparison with the H1 structure as 
35 well as the pig and rat structures. Confirmation of the A and B peptide chain structures has been provided 
by synthesis and chain recombination in vitro which produces a material which is biologically active in the 
uterine contraction assay. 

It will be seen from Figure 2 that the present and previous sequences show significant differences as 
well as similarities. Notable are: 
40 (1) Significant amino acid differences in three main areas: 

(a) the N-terminus of the B-chain 

(b) the N-terminus of the A-chain 

(c) the middle of the C-peptide, 

(2) Regions of strong homology in the B-chain and C-peptide: 
45 (a) 120 identical bases from Val 6 to lie 47 . 

(b) 88 - 90 identical bases from Phe 101 to Ser 132 . 
The two genes are therefore very similar but the differences are sufficient to indicate that the H2- gene 
is indeed a second gene and not simply a polymorph of H1. 

The mode of in vitro processing of the H1 preprorelaxin is not yet fully known but by analogy with pig 
so relaxin, cleavage oflhe signal peptide would be expected to occur at the Ala -1 -Lys 1 bond. Similarly 
excision of the H1-C peptide is predicted to occur at Leu 32 - Ser 33 and Arg 136 - Arg 137 , thus giving the Hi- 
fi and H1-A chains or respectively 32 and 24 residues (Figure 2). 

In H2 preprorelaxin Ala 1 has been replaced by Asp and so we would predict cleavage of the signal 
peptide after the alanine corresponding to position -2 in H1. Cleavage at the H2-B chain/C peptide junction 
55 ' is expected after Leu 32 by analogy to all other prorelaxins, thus leaving the H2-B chain with 33 residues. 
Cleavage at the H2-C peptide/A chain junction would occur after Arg 136 by analogy to rat preprorelaxin, 
thus leaving the H2-A chain with 24 residues. 

.As noted in our studies on pig relaxin, there are core sequences in the pig relaxin B and A chains which 
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contain all the essentia, elements for biological activity. Our synthetic studies on the human re.axin chain 

^ Thtinvtntion a,so includes any sub-unit of the H2-preprore.axin gene V^^^Z'^ 
any equivalent of the said sequence or sub-unit. Among the sub-un,ts to ^ ,n ^ 9 ^^ST^Z 
the individual structural genes coding for the signal peptide chain and the separate H2-A»d W£ P»J*£ 

of human prorelaxin. • HnuhiP-stranded DNA fragment for the 

-Those skilled in the art will appreciate that polymorphic lorms ot the genes may exist, auc 
»- ^^SStS^SS- the consents o, th. aPove seances. s ul ,un„s „ .orients, and 

Z»J^^^^*5SSi. ,s provide, a ON. tran 5 ,e, yeclor comp 

the deoxynucleotide sequences corresponding to the genes defined above. 

The transfer vectors of the invent.on may also inc » d « ^ h^ w example the cells or prokaryotic 

^ - -anster vectors commonly osed in Pacteria, 9 en.,ics am plasms anc I th . DNA o. certain 

S6 . bacteriophages. Bolh phage DNA and bactenal plasmrds have b . f J^ d J* ™ 9rr , p i oye d. The 



known in the art. 
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The invention also includes a prokaryotic or eukaryotic cell transformed by any of the transfer vectors 
described above. 

One preferred microorganism is the very familiar Escherichia coli , but any other suitable microorgan- 
ism may be used. 

5 According to a still further aspect of the present invention, there is provided a process for making a 

DNA transfer vector for use in maintaining and replicating a deoxynucleotide sequence coding for human 
preprorelaxin, characterised by ligating a deoxynucleotide sequence coding for human preprorelaxin with a 
DNA molcule prepared by cleaving a transfer vector with a restriction enzyme. 

DNA transfer vectors for use in maintaining and replicating deoxynucleotide sequences coding for 

w human prorelaxin and for the A and B chains of human relaxin may be similarly prepared from the 
appropriate deoxynucleotides. 

The A and B peptide chains, and also prorelaxin and preprorelaxin may be prepared by the usual 
process of gene expression, that is by growing cells containing the appropriate transformed transfer vector 
and isolating and purifying the required peptide(s) produced by the cells. 

75 Thus, the invention further includes a process for making a fusion protein comprising the amino acid 
sequence of human preprorelaxin as its C-terminal sequence and a portion of a prokaryotic or eukaryotic 
protein as its N-terminal sequence, characterised by incubating a cell culture transformed by an expression 
transfer vector comprising a deoxynucleotide sequence coding for human preprorelaxin, prepared in 
accordance with the process described above. 

20 Fusion proteins comprising the amino acid sequences for human prorelaxin and/or the A and/or B 
and/or C chains of human relaxin may be similarly prepared. 

The fusion peptide products thus produced will be in the form of a fusion protein in which the desired 
peptide is linked with a portion of a prokaryotic or eukaryotic protein characteristic of the host cell. Such 
fusion proteins also form a part of this invention. 

25 The invention also includes a process for synthesizing human prorelaxin comprising the A and B 
peptides separated from each other by a C peptide, characterised by incubating a culture of cells, 
transformed by an expression transfer vector comprising a deoxynucleotide sequence coding for said 
human prorelaxin, prepared as described above, under conditions suitable for expression of said sequence 
coding for human prorelaxin, a^d purifying human prorelaxin from the lysate or culture medium of said 

30 cells. 

' The peptide of interest can be recovered from the fusion product by any suitable known cleavage 
procedure. 

As already indicated above the transfer vector may be modified by codon substitution /deletion/addition 
and'such modifications will give rise to modified fusion peptides. In this way appropriate modifications may 
35 be made to facilitate the cleavage of the fusion peptides, for example, at the junction of B/C or C/A chains 
or to modify the peptide chain behaviour during subsequent chemical or biological processing. 
As indicated above, the invention also provides human relaxin, prorelaxin and preprorelaxin. 
Relaxin may be prepared by direct combination of the separate A and B chains by any of the 
procedures currently known and used for the preparation of insulin. 
40 Also in a similar manner to insulin, relaxin may be prepared from prorelaxin by oxidizing or otherwise 
converting the sulfhydryl groups on the A and B peptides of relaxin, prepared as described herein, to form 
disulfide crosslinks between said A and B peptides, and then excising the C peptides, for example, by an 
enzyme-catalyzed hydrolysis specific for the bonds joining the C peptide to the A and B peptides. 

Accordingly, the present invention further provides a method for the synthesis of human relaxin which 
45 comprises combining the A and B chains of relaxin (in their full-length, shortened or modified forms) by 
methods known per se for combination of A and B chains of human insulin. 

- 6ne such methocTcom prises reducing a mixture of the S-sulphonated A and B chains and then allowing 
the mixture to oxidize in air. 

We have also found that the efficiency of the above procedure is improved when one or both of the A 
so and B chains is in the form of an S-thioethyl-cys derivative rather than the S-sulpho form. 

In our Australian Patent Application No. 15413/83 (PF 4385/82) we also showed that one or both of the 
A and B chains of relaxin can be shortened at the amino and/or carboxy terminii without significant loss of 
biological activity and with improved combination yields. These techniques apply equally to the preparation 
of human relaxin. 

55 Another aspect of the invention provides a human relaxin* analogue consisting essentially of shortened 
and/or modified forms of the natural B and/or A peptide chains. 

-This aspect of the invention also provides a method for producing a human relaxin analogue which 
comprises the step of forming the shortened and/or modified B and/or A peptide chains and combining 
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them by any of the methods described above. These latter aspects are dealt with in the Divisional 
application nr. 881 10103 published as EP-A-0303033. 

Our investigations with both pig and human relaxin (H1) show that relaxin activity may be present with 
human A chains as short as A(10-24) and B chains as short as B(l0-22) although the expected practical 
minima are respectively A<4-24) and B(4-23). The synthetic pig peptide A(4-24)-B<1-25) is already known to 

have relaxin activity. OA . . 

In general, for the present relaxin structure (H2) the A chain can be varied from A(1-24) to A(10-24) and 

B chain from B(-1-32) to B(10-22). 

The preferred combinations are derived from: 

A b 

(1-24) (-1-23 ) 

any of (2-24) with any of (up to) 

(3-24) ("I- 32 > 



20 Modifications of the B and/or A chains, in accordance with the present invention may involve e.ther 
"genetic" modification, as described above, or chemical modification of the B and/or A chains (in e.ther full- 
length or shortened form) prior to combination by the method of the invention. Two types of modification 
may be employed, either singly or in combination. 

The first type involves the modification of one or more of the amino-acids which occur .n the natural or 

25 shortened B and/or A chains. Such modification will generally involve protection of active groups on one or 
more of the amino-acids by methods known per se , and the protecting groups may, if desired, be removed 
after combination of the (modified) A and B chains. 

Examples of this type of modification include the acetylation, formylation or similar protection of free 
amino groups, including the^-terminal, amidation of C-terminal groups, or the formation of esters of 

3° hydroxy! or carboxylic groi/ps." The formyl group is a typical example of a readily-removable protecting 

9r0U The second type of modification includes replacement of one or more of the natural amino-acids in the 
B and/or A chains with a different amino acid (including the D-form of a natural amino-acid). This genera 
type of modification may also involve the deletion of a natural amino-acid from the chain or the addition of 
35 one or more extra amino-acids to the chain. 

The purpose of such modifications is to enhance the combination yields of the A and B chains. whHe 
maintaining the activity of the product, i.e., relaxin or an analogue thereof, or to enhance or modify the 
activity of the product for a given combination yield. Such modification may extend to the production of 
svnthetic analogues which have relaxin-blocking or -antagonistic effects. 
«° A specific example of the first type of modification is the modification of the tryptophan (Trp) residue at 
B2 by addition of a formyl group. nnA 

Examples of the second type of modification are replacement of the Met moiety at B24 with norleuc.ne 
(Nle) valine (Val), alanine (Ala), glycine (Gly). serine (Ser) or homoserine (HomoSer). 

The invention in this aspect also includes human relaxin analogues formed from natural or shortened B 
45 and/or A chains modified in accordance with the invention as described above. 

The A and B peptide chains, and also prorelaxin and preprorelaxin may be prepared by the usual 
process of gene expression, that is by growing a microorganism containing the appropriate transformed 
transfer vector and isolating and purifying the required peptide(s) produced by the microorganism. 

The peptide products thus produced may be in the form of a fusion protein in which the desired peptide 
50 is linked with a portion of a prokaryotic protein 

The invention is further described and illustrated by the following description of the experimental 
procedures used and the results obtained thereby. 



65 Methods and Materials 



Messenger RNA isolation and cDNA cloning 
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Human ovarian tissue obtained during surgery for the treatment of an ectopic pregnancy was quickly 
frozen on dry ice and the RNA + isolated in 5M guanidinium thiocyanate (Merck) according to the method of 
Chirgwin et al 1979. Poly-A* RNA was converted into double stranded DNA (Wickers et al, 1978) and 
cloned either by the homopolymeric G/C tailing method into a pBR322 plasmid vector (Chang et al.. 1978) 
or by the lambda packaging method using the XGT10 vector (Huynh et al., 1983). In our experience the 
efficiency of transformation with the pBR322 method (10* recombinants/ug of cDNA) was far less efficient 
than the lambda technique (up to 10 6 recombinants/ug of cDNA). 



w Preparation of hybridization probes 




Radiolabeled probes were prepared by primed synthesis on various DNA fragments using denatured 
random primers of calf thymus DNA (Hudson et al. ( 1983, Taylor et al., 1976). The DNA template (100-200 
ng) was denatured with the random primers (1 ug) by boiling in 20 ul of water for 2 minutes. Synthesis was 
is initiated by the addition of a 30 ul reaction mixture containing 50mM Tris-HCI pH 8.0, 50- NaCI, 1mM DTT, 
10mM MgCb, 5 units of E. coli DNA Polymerase 1 (Klenow fragment), 500 uM each of dCTP, dGTP, dTTP 
and 0.3 uM a-[ 32 P]-dATP (Approx. 3000 Ci/mmol, Amersham). After incubation at 37° C for 30 minutes the 
reaction was terminated by dilution into 300 ul of a buffer containing 0.3M NaCI, 10mM Tris-HCI, pH 8.0, 
1mM EDTA and passed through a Sephadex-G50 column, (1cm x 5cm) in the same buffer. The 
radiolabelled probe was collected from the peak fractions at void volume and precipitated with 2 volumes of 
ethanol at -20° C for 2 hours using tRNA (10 ug) as carrier. 

Selection of specific cDNA clones 

To screen the human ovarian cDNA clone bank for relaxin specific sequences we used as a probe a 
segment of the previously identified human H1 gene corresponding to a 400 nucleotide segment coding for 
the C peptide and A-chain from amino acid 64, through the termination codon and including 80 bases of the 
3' untranslated region. A single positive cDNA clone from the pBR322 library was isolated and sequenced. 
30 23 unique recombinants ^tere isolated from the XGT10 libraries, but of these only 6 were subjected to 
complete nucleotide sequence analysis. 



25 
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DNA sequence analysts 

The sequencing strategy and an abbreviated restriction map of the cDNA clones are summarized in 
Figure 1. The recombinant plasmid in pBR322 was digested with restriction enzymes Hpa II (P), Hinf I (F) or 
Taq I (T) and end-labelled using reverse transcriptase and the appropriate a-labelled deoxynucleotide 
triphosphate (dCTP for Hpa II, and Taq I, dATP for Hinf I). Fragments were cleaved internally with a second 
restriction endonuclease and then separated by electrophoresis on 8% polyacrylamide gels prior to 
sequencing by the chemical degradation method of Maxam and Gilbert et al, 1977. 

cDNA clones in'XGTIO were -sequenced by subcloning Eco R1 restriction fragments into M13mp9 and 
employing the techniques described by Sanger et al, (1980). 



45 



Southern and Northern gel analyses 



Performed on purified genomic DNA after restriction endonuclease cleavage by the method of Southern 
(1975) or on purified RNA. The DNA fragments which were used as probes were found to be specific for 

so either exon I or exon II of the H1 genomic clone despite having a small amount of flanking sequences. 
These fragments were generated by subcloning into M13mp8 a 500 bp Alu I fragment of the XH7 clone in 
the case of the exon I probe, or a 400bp.Eco Rl-Ava II fragment for the exon II probe. A probe from the H2 
cDNA clone was generated by digesting with Hinf I and isolating a 300 bp doublet corresponding to the 
coding region from Asp 1 to the termination codon and including 110 bases of the 3* untranslated region 

-55 (Figure 1). Oligonucleotide probes were synthesized by the phosphite chemistry method of Beaucage and 
Caruthers (1981) and were end-labelled with 7 - 32 P-ATP using T4 polynucleotide kinase. Hybridization 
conditions were calculated on the basis of the G + C content. 
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Isolation and nucleotide sequence analysis of the H2 genomic clone 



The human genomic lambda library of Lawn et at (1978) was screened by method described earlier 
(Hudson et al, 1983) except that a mixture of DNA fragments corresponding to exons I and II of the H1 

5 genomic clone was used for the probe as described above. Positive phage were grown in litre scale liquid 
cultures, the DNA isolated and digested with restriction endonucleases prior to mapping with the exon I and 
II probes. A 4 kilobase EcoRI fragment was found to contain the entire exon I coding region which 
differentiated this clone from the homologous H1 gene structure. This fragment was subcloned into 
M13mp8 and sequenced by the technique of Maxam and Gilbert (1977). After digesting with Ava I, 

w fragments spanning the coding region were end-labelled and cleaved internally by a second restriction 
enzyme (Hpa II of Hinf I) to generate fragments suitable for sequence analysis. 

Isolation of a cDNA clone 

15 

Samples of human corpus luteum were made available to us as a result of surgical intervention in 
ectopic pregnancies or from lutectomy at the time of Caesarian section. From the RNA isolated from a 
single corpus luteum a cDNA library was constructed in pBR322 providing about 300 unique recombinants. 
Screening this library with an H1-cDNA probe revealed a single recombinant with sequence homology to 

20 human relaxin I. To increase the total number of recombinants from such small amounts of ovarian tissue 
we constructed cDNA libraries using the XGT10 cloning system (Huynh et al , 1983). Screening with a 
relaxin-specific probe identified 23 unique cDNA clones of which six were characterized as shown in Figure 
1. Nucleotide sequence analysis revealed that all 6 cDNA recombinants encoded fragments of the. same 
relaxin structural gene (Figure 2), yet this sequence was different to the genomic clone reported earlier 

25 (Hudson et al., 1983). We expected that this novel sequence corresponded to the second human relaxin 
gene (H2) which had been observed in genomic DNA. 

Surprisingly, none of the cDNA clones contained a polyadenosine sequence at the 3 1 end, although the 
size of cDNA clones in PBR322 and XGT10 (1800 bp and 1900 bp respectively) indicate that large 
transcription products wene being synthesized during the cloning procedure. These two cDNA clones had 

30 overlapping sequence identity at the 3' terminus confirming that they were derived from the same mRNA 
structure. We attributed tTie loss of the poly-A tail either to premature termination of the double-stranding 
transcription reaction or to excessive S1 nuclease degradation during the cloning procedure. 



35 Isolation of a genomic clone corresponding to the second gene 



A thorough screen of 10 8 recombinant phage from the human genomic library of Lawn et. al., (1978) 
using mixed probes specific for exon I or II of the XH7 relaxin clone revealed 16 positive phage. Small scale 
restriction mapping analysis revealed that 14 of these recombinant phage corresponded to the H1 relaxin 

40 gene reported earlier (11 were identical to the XH7 genomic clone; 3 were identical to XH5 a different 
genomic xlone of the H1 gene as previously reported by Hudson et. al., 1983). However, the other 2 
recombinant phage were identical and had a unique restriction pattern characteristic of the H2 relaxin gene 
whose structure is given in Figure 1. The unusual ratio of recombinants reflects either their proportion in the 
original genomic library or results from selective growth during amplification. Southern blot analyses of this 

45 new recombinant phage (XH11) using separate probes corresponding to either exon I or II of the XH7 clone, 
revealed that XH11 contained only the exon I coding region. Attempts to find a full length genomic clone 
corresponding to the H2 relaxin gene either in the library of Lawn et. al. (1978) or in another library (Dr. R. 
Crawford, unpublished) have so far been unsuccessful. 

The nucleotide sequence of the relaxin coding region of XH11 was found to be identical to that 

so observed in the cDNA clone shown in Figure 2. An intron interrupts the coding region in exactly the same 
position as in the XH7 genomic clone (Hudson et. al., 1983) suggesting that these genes arose by a gene 
duplication event at some point in evolution. 

• 55 Northern gel analysis 

RNA was isolated from several samples of human corpora lutea taken from different individuals during 
surgical intervention for ectopic pregnancy or during Caesarian section operations. Northern gel analysis 
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using probes made from the coding region of either relaxin gene revealed that two major mRNA species of 
approximate sizes 1000 bp and 2000 bp were present in five human ovarian RNA samples tested (Figure 
3). The smaller mRNA species were 2-3 fold more abundant in the RNA samples tested and this result was 
independent of whether the probe used in the analysis corresponded to H1 or H2 relaxin indicating that high 
cross-hydridization rates occur under our experimental conditions. To differentiate whether these two mRNA 
species represent the separate products of the H1 and H2 genes, oligonucleotide probes were synthesized 
over a region of minimum homology (60%) between the two relaxin genes (residues 137-144 in Figure 2). 
These synthetic 25 mers were radiolabeiled by kinasing with T -3 2 P-ATP and used as hybridization probes 
under conditions shown to provide specificity for either the H1 or H2 gene (Figure 3). Northern gel analysis 
using these radiolabeiled probes revealed that both mRNA species corresponded to products of the H2 
gene. We could not detect any transcription products from the H1 gene using the specific probes, although 
low level expression (less than 5% of the H2 level) would have been difficult to identify. 

To analyse the different mRNA transcripts from the H2 gene, we made specific probes from segments 
of the two large H2 cDNA clones corresponding to the coding region and 5' and 3* untranslated regions 
(Figure 4). The larger mRNA transcript (approximately 2kb in length) selectively hybridized to segments of 
the 3' untranslated region from both cDNA clones, from a position approximately 100 bases from the 
termination codon. A potential polyadenylation signal exists in the nucleotide sequence of the cDNA clones, 
140 bases from the termination codon, and this region does have homology to the porcine relaxin 
polyadenylation site. However, the question of whether the shorter mRNA product is polyadenylated near 
this position cannot be resolved until full length cDNA clones corresponding to both mRNA forms have 
been isolated and characterised. 

- In the absence of the genomic sequence of the H2 gene it is impossible to define the mechanisms 
leading to the formation of the two mRNA transcripts. It is possible, like the collagen and /S-microglobuiin 
genes, that cleavage of the primary RNA transcript could occur at alternative polyadenylation sites. On the 
other hand we cannot rule out the possibility of alternative splicing mechanisms such as occurs in the 
calcitonin, growth hormone and a-crystallin genes. 

The primary structure^q Hreproreiaxin encoded by the H2 gene 



The mode of in vivo processing of the human preprorelaxin genes is not yet fully understood and has to 
be deduced by analogy to the processing of porcine and rat preprorelaxins (Figure 5). The predicted B and 
A chain structures for the H1 and H2 genes have been aligned to other members of the relaxin family and 
human insulin in Figure 5. 

Cleavage of the signal peptide in H1. has been predicted (Hudson et al., 1983) to occur after a short 
side chain residue such as Ala" 1 , -2 or -4 or after Ser-6. Cleavage after Ala-1 is consistent with the 
homology to porcine preprorelaxin and human preproinsulin. Similarly, cleavage of the H2 signal peptide 
probably occurs after Ala-2 by such analogy, although cleavage after Ala-4 or Ser-6 are other possibilities. 

By analogy to rat and pig prorelaxins, cleavage at the B chain/C peptide junction would occur after Leu 
32 in both H1 and H2 precursors. However, both human relaxin B chains possess at positions 29-30 the 
conserved dibasic sequence Lys-Arg, which is a known processing site in other prohormones such as 
proinsulin, and cleavage here cannot be excluded. Direct amino acid sequence analysis of relaxin isolated 
from corpora lutea of pregnancy will be required to settle this point. In the meantime it. seems that the most 
likely structure of the H1 B chain would be 32 residues in length (Lys 1 to Leu 32) and the H2 B chain 
45 would be 33 residues (Asp-I to Leu 32). 

Cleavage at the C peptide/A chain junction of H1 prorelaxin has been predicted (Hudson et al., 1983) to 
occur after Arg 136 within a group of 4 basic residues because the Arg-Pro imide bond at 137-138 would 
be resistant to proteolysis. H2 prorelaxin has the same sequence of 4 basic residues and a similar 
processing step after Arg 136 would result in both the H1 and H2 relaxin A chains being 24 residues in 
so length. 

Biological Activity of the H2 gene 

As noted in earlier studies on synthetic pig relaxin peptides, there are core sequences in the pig relaxin 
B and A chains which contain all the essential elements for biological activity. Our synthetic studies on the 
H1 relaxin peptides has shown that combination of the complete H1 A chain (Arg 137-CYS 160) to a 
shortened form the H1 B chain (Lys 1-Ser 25) produced material which possessed biological activity 
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(Hudson et al. ( 1983). Further studies on both the H1 and H2 gene structures using peptide synthesis 
reveals that both genes code for forms of relaxin which are biologically active in the rat uterine contractility 
assay. 

5 

Chemical Synthesis of a modified human relaxin H2 (hRLX) A(1-24) - B(-1-24) 

(i) Synthesis of human relaxin A-chain, H2 hRLX A(1-24) 

The amino acid sequence corresponding to residues 1 to 24 of the human relaxin A-chain, deduced 
io as described above from the nucleotide sequence of the cDNA clone, was synthesized by the solid- 
phase procedure according to the general principles described by Merrifield (e.g. Barany, G. and 
Merrifield, R.B. In "The Peptides". Ed. E. Gross & J. Meienhofer, Academic Press, N.Y., pp. 1-284, 
1980). 

N-a-tertiarybutyloxycarbonyf -4-methylbenzyl-L-cysteine Thereinafter "BOC") was coupled to a 1% 

15 crossiinked polystyrene resin via the phenylacetamidomethyl (PAM) linkage to a level of 0.30 mmole/gm 
using the method of Tarn et aU (Synthesis 12 , 955-957, 1979). The BOC-L-CYS-PAM resin (8.0 gm) 
was transferred to the reaction vessel of a Beckman Model 990 Peptide Synthesizer and the amino acid 
sequence from residues 23 through to 1 was assembled by the stepwise addition of each suitably 
protected amino acid. The amino terminal BOC protecting group of each amino acid was removed by 

20 treatment of the resin with 35% trifluoroacetic acid in methylene chloride for 30 minutes followed by 
neutralization with 5% diisopropytethylamine in methylene chloride for 15 minutes. After each treatment 
the resin was washed thoroughly with methylene chloride. The next amino acid in the sequence 
(suitably-protected at the a-amino with the BOC group and where necessary with the side-chain 
functional group appropriately protected) wad coupled to the resin using dicyclohexylcarbodiimide 

25 (DCC). The resin was stirred with the amino acid in methylene chloride for 10 minutes prior to the 
introduction of the DCC which was also dissolved in methylene chloride. A 2.5 molar excess (6.0 mmole) 
of amino acid and DCC was used for each coupling. After stirring for 1 hour a sample of the resin was 
removed from the reaction mixture and tested for the presence of free amino groups using the ninhydrin 
procedure of Kaiser et ^l. (Anal. Biochem., 34 , 595-598, 1970). If the ninhydrin test was negative 

30 indicating complete couplftirg the reaction cycle~was continued with BOC deprotection, neutralization and 
coupling of the next amfrio acid. For a positive ninhydrin test the coupling reaction was repeated with 
further amino acid and DCC. 

Amino acids with side-chain functional groups were used as the following protected derivatives: N-a- 
— BOC-2,6-dichlorobenzyl-L-tyrosine, N-a-BOC-t -chlorobenzyloxycarbonyl-L-lysine; N-ot-BOC-L-serine O- 

35 benzyl ether; N-a-amyloxycarbonyl-N G -tosyl-L-arginine; N-a-BOC-L-threonine O-benzyl ether; N-a-BOC- 
S-ethyl mercapto-L-cysteine (for CYS at A-chain sequence position 15, 11 and 10). 

Following the assembly of the 1-24 peptide sequence, the final BOC group on the amino terminal 
arginine was removed using the deprotectdion neutralization cycle and the peptide-resin dried in vacuo 
(wt of peptide resin 13.0 gm). A portion of the peptide-resin (2 gm) was treated with anhydrous hydrogen 

40 fluoride in the presence of anisole (2 ml) at 0* C for 30 minutes. The total time for contact of the resin- 
peptide with hydrogen fluoride (HF) was kept to a minimum (not more than 70 minutes) by rapid removal 
of the HF under oil-pump vacuum. The resin-peptide was then washed several times with ethyl acetate to 
remove excels anisole, the peptide extracted into 1 M acetic acid and the solution lyophilized. The yield 
of crude peptide, (with the cysteines at positions 10, 11 and 15 still protected as the S-thioethyl 

45 derivative) was 392 mg. Initial purification of the crude peptide was by gel-filtration on Biogel P10 in 
0.1 M acetic acid. The fractions representing the major -peak from this column, which eluted at a position 
corresponding to a molecular weight of approximately 3000, were collected and lyophilized. Amino acid 
analysis of a sample of this peptide indicated that ail the amino acids of the 1-24 sequence were present 
in the correct ratio. 

so Further purification of the [S-thioethyl Cys 10 l1l5 ]-hRLX A. (1-24) peptide was affected by preparative 

reverse-phase HPLC on a Waters C-18 Bondapak column using a 0.1% TFA-water/acetonitrile solvent 
system. 

A sample (80 mg) of the peptide purified by gel-filtration was S-suifonated with a mixture of sodium 
sulfite and sodium tetrathionate (total reaction time of 3 hours) according to the method described by Du 
55 et al., (Scientia Sinica, 101, 84-104 (1961)). The precipitate which formed during the S-sulfonation 
reaction was removed by filtration and both the precipitate and the supernatant solution dialyzed against 
distilled water at 4* C for 48 hours. The contents of the dialysis bags were lyophilized to yield 39.5 mg of 
peptide from the supernatant solution and 20.3 mg of peptide from the precipitate which occurred during 
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the S-sulfonation reaction. A sample of the 'soluble 1 [S-sulfo Cys 10 - 11 - 15 - 24 ] hRLX A(1-24) peptide was 
purified by preparative reverse-phase HPLC on a Waters C-18 Bondapak column using a 0.1% TFA- 
water/acetonitrile solvent system. 

(ii) Synthesis of shortened human relaxin B-chain, H2 hRLX B(-1-24) 

5 The amino" acid sequence corresponding to relTdues -1 to 24 of the H2 human relaxin B-chain was 

synthesized using the procedures described above and commencing with 6.0 gm N-a- 
tertiarybutyloxycarbonyl-L-methionine-O-benzyl-L-serine-phenylacetamido-methyl polystyrene resin with 
a loading of 0.5 mmole Met per gm. The side-chain protecting groups used in the A-chain synthesis 
were also employed for the B-chain including the S-ethyl mercapto derivative for both cysteines at 

70 positions 10 and 22. The glutamic acid residues at positions 4 and 5 and the aspartic acid residue at -1 
were added as the N-a-BOC-benzyl ester derivative. The glutamine at position 18 was coupled by the 
active ester procedure using N-a-BOC-L-glutamine-p-nitrophenyl ester in DMF. Following coupling or the 
tryptophan at position 2, 0.1% indole was added to the trifluoroacetic acid deprotecting reagent and to 
the subsequent methylene chloride washes. 

75 The final weight of peptide-resin after removal of the BOC group from the amino terminal aspartic 

acid residue and vacuum-drying was 8.5 gm. A portion of the peptide resin (3.5 gm) was treated with 
anhydrous hydrogen fluoride in the presence of anisole (2 ml)^at 0°C for 30 minutes and the B-chain 
peptide isolated using the procedure described above for the A-chain. The crude [S-thioethyl Cys 10,22 ] 
. hRLX B(-1-24) (0.97 gm) was purified by gel filtration on BioGel P10 in 1M acetic acid followed by 

20 preparative HPLC. 

A sample (100 mg) of the peptide purified by gel filtration was S-sulfonated at pH 8.3 for 3 hours, the 
reaction mixture filtered and the precipitate and supernatant solutions dialyzed against distilled water. 
The 'soluble' peptide recovered after lyophilization was 42.4 mg; the 'insoluble* peptide was 59.5 mg. 
The S-suffonated B-chain peptides were further purified by preparative HPLC using a C-18 reverse- 

25 phase column and 0.1% TFA-water-acetonitrile solvent system. 

(iii) Chain Combination 

The synthetic H2 hRLX A(1-24) and H2 hRLX B(-1-24) peptides were combined using the procedure 
described by Chance and Hoffmann (Australian Patent Application No. 68844/81) for insulin chains 
wherein the S-sulfonated peptides were mixed in a ratio of A : B of 2.6: 1 at a peptide concentration of 
30 10 mg/ml in glycine buff^r^pH 10.5. Dithiothreitol in glycine buffer was then added in an amount to give a 
total of 1.0 sulfhydryl groups for each S-sulfo group. The reaction mixture was then stirred in an open 
vessel for 24 hours. 

As a further modification to this procedure we have found that the chain combination reaction to form 
r- biologically active relaxin proceeded efficiently when one or preferably both of the peptide chains are 

35 used as their S-thioethyl-Cys derivatives rather than in the S-sulfo form specified by Chance and 
Hoffmann (op.cit.) in the case of insulin. The use of S-thioethyl Cys peptides eliminates a reaction and 
purification step required to convert the peptides to the S-sulfo derivatives. In our experience the S- 
sulfonation reaction of relaxin peptides is accompanied by side reactions which render the S-sulfo 
peptides difficult to purify resulting in low yields. 

40 Using the above conditions chain combination yields from 1.5 to 6.0% have been achieved as 

measured by biological activity in the rat uterine contractility assay of Wiqvist & Paul (Acta Endocrinol., 
29, 135-136, 1958). 



45 Example of Chain Combination Reaction 

Human relaxin H2 [S-thioethyl Cys 10 ' 11 ' 15 ] A(1-24) (4.2 mg dry wt., 2.4 mg peptide by amino acid 
analysis, 0.84 umole) was dissolved in 500 ul of 0.1 M glycine buffer pH 10.5 in a 3 ml stoppered plastic 
centrifuge tube. Human relaxin H2 [S-sulfo Cys 10 * 11 ] B(-1-24) (1.60mg, 1.60 mg peptide by amino acid 

so analysis, 0.33 umole) dissolved in 200 Ul of 0.1 M glycine buffer pH 10.5 was added and the mixture 
agitated. An aliquot (23.0 ul, 2.21 umole DTT) of a stock solution of dithithreitol (DTT) made up in 0.1 M 
glycine buffer pH 10.5 (0.96 umole DTT in 10 ul) was added to the peptide solution and following a brief 
agitation the reaction mixture was allowed to stand at 4° C for 24 hours open to the air. The mixture was 
then centrifuged and aliquots of the supernatant solution tested for relaxin biological activity in the rat 

55 . uterine contractility assay. Aliquots of the reaction mixture .inhibited the spontaneous contractions of the rat 
uterus in a dose-related manner. A 75 ul aliquot completely inhibited uterine contractions equivalent to a 
chain combination yield of 5.3% as compared to a native pig relaxin A22 B31 standard. 
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Synthesis of authentic human relaxin H2 : hRLX A(1-24) - B(-1-32) 

(i) Synthesis of full length H2 human relaxin B^chain JRLX B£32) reiaxin 

The ami no icld sequence corresponding to residues -1 to +32 of tne tun ,en 9< n . N 

B-chain was synthesised using the procedures described above and commencng N a 

5 l^Z^Z^,^ pheny, acetamido ^Pj^JJ, *Jfi3£ -e 

mmo. Leu per gm. The side-chain protecting ^s vsec for both cysteines at 

STt^ — of boc - l - 

'° of ^5S^^5^ assemb,y was 8.2gm. A portion of the pepti.e 

r esin T ^^ 

sr rrr^srsi ^=^^0^ «* w ere 

collected and lyophilised. neDtide by preparative HPLC 

gel chromatography was used directly in chain combination experiments, 
(ii) Chain combination of A(1-24) with B(-1 - 32) : 

predion of human relaxin H£ thirtethu , H 2 human relaxin A(1-24) peptides were coupled to S- 

in diverse forms thereof. 
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Claims 

1. A DNA fragment encoding human H2-preprorelaxin, said- H2-preprorelaxin having the amino acid 
sequence set out in Figure 2. 

2. -A DNA fragment encoding human H2-prorelaxin, said H2-prorelaxin having the amino acid sequence 

set out in Figure 2 with the exception that the signal sequence is excluded. 

3. A DNA fragment encoding a polypeptide having human H2-relaxin activity, said polypeptide having an 
A-chain and a B-chaifi Comprising the following amino acid sequences: 

A-Chain 

Gin Leu Tyr Set Ala Leu Ala A$n Lys Cys Cys His Val Gly 
Cys Thr Lys Arg Ser Leu Ala Arg Phe Cys 

B-Chain 

Asp Ser Trp Met Glu Glu Val lie Lys Leu Cys Gly Arg Glu 
Leu val Arg Ala Gin He Ala He Cys Gly Met Ser Thr Trp 
Ser Lys Arg Ser Leu* 



4. A DNA fragment encoding the signal A, B or C peptide chains of human H2-relaxin or any combination 
of two or more of said chains; wherein said peptide chains have the following amino acid sequences: 
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Signal Peptide 

Met Pro Arg Leu Phe Phe Phe His Leu Leu Gly Val Cys Leu 
Leu Leu Asn Gin Phe Ser Arg Ala Val Ala 

A-Chain 

Gin Leu Tyr Ser Ala Leu Ala Asn Lys Cys Cys His val Gly 
Cys Thr Lys Arg Ser Leu Ala Arg Phe Cys 

B-Chaln 

Asp Ser Trp Met Glu Glu Val lie Lys Leu Cys Gly Arg 
Leu Val Arg Ala Gin He Ala He Cys Gly Met Ser Thr 
Ser Lys Arg Ser Leu 



C-Chain 



Ser 


Gin 


Glu 


Asp 


Ala 


Pro 


Gin 


Thr 


Pro 


Arg 


Pro 


Val 


Ala 


Glu 


lie 


Val 


Pro 


Ser 


Phe 


He 


Asn 


Lys 


Asp 


Thr 


Glu 


Thr 


He 


Asn 


Met 


Met 


Ser 


Glu 


Phe 


Val 


Ala 


Asn 


Leu 


Pro 


Gin 


Glu 


Leu 


Lys 


Leu 


Thr 


Leu 


Ser 


Glu 


Met 


Gin 


Pro 


Ala 


Leu 


Pro 


Gin 


Leu 


Gin 


Gin 


Mis 


Val 


Pro 


Val 


Leu 


Lys 


Asp 


Ser 


Ser 


Leu 


Leu 


Phe 


Glu 


Glu 


Phe 


Lys 




Leu 


He 


Arg 


Asn 


Arg 


Gin 


Ser 


Glu 


Ala 


Ala 



Asp Ser Ser Pro Ser Glu Leu Lys Tyr Leu Gly Leu Asp Thr 
His Ser Arg Lys Lys Arg, 



A double-stranded DNA fragment encoding human H2-preprorelaxin, characterized in that it comprises 
a coding strand and a complementary strand corresponding to the following complete mRNA sequence: 
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AUG 


ecu 


CGC 


CUG 


UUU 


UUU 


UUC 


GAC 


CUG 


CUA 


<arV»A 


uuu 


fTfifT 
VW 


TUTA 


CUA 


CUG 


AAC 


CAA 


UUU 


UCC 


AGA 


GCA 


GUC 


GCG 


GAC 


UCA 




Artn 

AUVj 


GAG 


GAA 


GUU 


AUU 


AAA 


UUA 


UGC 


GGC 


CGC 




UUA 




UOU 




CAG 


AUU 


GCC 


AUU 


UGC 


GGC 


AUG 


AGC 


ACC 


uGG 


AGU 


& A A 
AAA 




iir*iT 

\JK-\J 


CUG 


AGC 


CAG 


GAA 


GAU 


GCU 


ecu 


CAG 




LLU 




UUA 






GAA 


AUU 


GUG 


CCA 


UCC 


uuc 


AUC 




jv j\ r. 
AAA 








ACC 


AUA 


AAU 


AUG 


AUG 


UCA 


GAA 


UUU 


<*uu 






U UV» 




C*AG 


GAG 


CUG 


AAG 


UUA 


ACC 


CUG 


UCU 


GAG 


AUG 


CAG 


LtA 


VjU A 


UUA 


UUA 




CUA 


CAA 


CAA 


CAU 


GUA 


ecu 


GUA 


UUA 


AAA 




ecu 


AV?U 


PHI! 


cue 


UUU 


GAA 


GAA 


UuU 


AAG 


> 

AAA 


/-»t iff 

cuu 


& f ff T 


U«U 


21 ATT 






AGU 


GAA 


GCC 


GCA 


GAC 


AGC 


AGU 


ecu 


UCA 


GAA 


UUA 


AAA 


UAC 


UUA 


GGC 


UUG 


GAU 


ACU 


CAU 


ucu 


CGA 


AAA 


AAG 


AGA 


CAA 


cue 


UAC 


AGU 


GCA 


UUG 


GCU 


AAU 


AAA 


UGU 


UGC 


CAU 


GUU 


GGU 


UGU 


ACQ 


AAA 


AGA 


UCU 


CUU 


GCU 


AGA 


UUU 


UGC 


UGA 























A double-stranded DNA fragment encoding human H2-prorelaxin, characterized in that it comprises a 
coding strand and a complementary strand corresponding to the following mRNA sequence: 



GAC 


UCA 


UGG 


AUG 


GAG 


GAA 


GUU 


AUU 


AAA 


UUA 


UGC 


GGC 


CGC 


GAA 


UUA 


GUU 


CGC 


GCG 


CAG 


AUU 


GCC 


AUU 


UGC 


GGC 


AUG 


AGC 


ACC 


UGG 


AGC 


AAA 


AGG 


UCU 


CUG 


AGC 


CAG 


GAA 


GAU 


GCU 


ecu 


CAG 


ACA 


ecu 


AGA 


CCA 


GUG 


GCA 


GAA 


AUU 


GUG 


CCA 


UCC 


UUC 


AUC 


AAC 


AAA 


GAU 


ACA 


GAA 


ACC 


AUA 


AAU 


AUG 


AUG 


UCA 


GAA 


UUU 


GUU 


GCU 


AAU 


UUG 


CCA 


CAG 


GAG 


CUG 


AAG 


UUA 


ACC 


CUG 


UCU 


GAG 


AUG 


CAG 


CCA 


GCA 


UUA 


CCA 


**■ " 


CUA 


CAA 


CAA 


CAU 


GUA 


ecu 


GUA 


UUA 


AAA 


GAU 


UCC 


AGU 


CU- 


Z'ZC 


UUU 


GAA 


GAA 


UUU 


AAG 


7iAA 


CUU 


AUU 


CGC 


AAU 


AGA 


CAA 


AGU 




GCC 


GCA 


GAC 


AGC 


AGU 


ecu 


UCA 


GAA 


UUA 


AAA 


UAC 


UUA 


GGC 


CUG 


GAU 


ACU 


CAU 


UCU 


CGA 


AAA 


AAG 


AGA 


CAA 


cue 


UAC 


AGU 


GCA 


UUG 


GCU 


AAU 


AAA 


UGU 


UGC 


CAU 


GUU 


GGU 


UGU 


ACC 


AAA 


AGA 


UCU 


CUU 


GCU 


AGA 


UUU 


UGC 


UGA 















A double-stranded DNA fragment encoding the signal peptide, A, B or C peptide chains of human H2- 
preprorelaxin or a combination of any two or more of said chains characterized in that it comprises a 
coding strand and a complementary strand corresponding to the appropriate mRNA sequence or 
combination of the mRNA sequences given below: 
Signal Peptide 
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AUG CCU CGC CUG UUU UUU UUC CAC CUG CUA GGA GUC UGU UUA 
CUA CUG AAC CAA UUU UCC AGA GCA GUC GCG 

A-Chaxn 

CAA CUC UAC AGU GCA UUG GCU AAU AAA UGU UGC CAU GUU GGU 
UGU ACC AAA AGA UCU CUU GCU AGA UUU UGC 



B-Chain 

75 

GAC UCA UGG AUG GAG GAA GUU 
UUA GUU CGC GCG CAG AUU GCC 
AGC AAA AGG UCU CUG 

20 

C-Chain 



AUU AAA UUA UGC GGC CGC GAA 
AUU UGC GGC AUG AGC ACC UGG 



25 



30 




45 



50 



55 



AGC 


CAG 


GAA 


GAU 


GCU 


CCU 


CAG 


ACA 


CCU 


ACA 


CCA 


GUG 


GCA 


GAA 


AUU 


GUG 


CCA 


UCC 


UUC 


AUC 


AAC 


AAA 


GAU 


ACA 


GAA 


ACC 


AUA 


AAU 


AUG 


AUG 


UCA 


GAA 


UUU 


GUU 


GCU 


AAU 


UUG 


CCA 


CAG 


GAG 


CUG 


AAG 


UUA 


ACC 


CUG 


ucu-gag 


AUG 


CAG 


CCA 


GCA 


UUA 


CCA 


CAG 


CUA 


CAA 


CAA 


CAU 


GUA 


ecu 


GUA 


UUA 


AAA 


GAU 


UCC 


AGU 


CUU 


CUC 


UUU 


GAA 


GAA 


UUU 


AAG 


AAA 


CUU 


AUU 


CGC 


AAU 


AGA 


CAA 


AGU 


GAA 


GCC 


GCA 


GAC 


AGC 


AGU 


CCU 


UCA 


GAA 


UUA 


AAA 


UAC 


UUA 


GGC 


UUG 


GAU 


ACU 


CAU 


UCU 


CGA 


AAA 


AAG 


AGA 



















10. 

11. 

12. 
13. 
14. 



A process for the production of a DNA fragment as set out in any one of claims 1 to 7, characterized in 
that it comprises screening a human cDNA clone bank using as a probe a fragment of human H1- 
relaxin DNA. 

A process as claimed in claim 's, characterized in that the said fragment comprises nucleotides of C- 
peptide/A-peptide coding region of the human Hl-relaxtn DNA. 

A DNA transfer vector, characterized in that it contains a cDNA deoxynucleotide sequences cor- 
responding to a gene or DNA fragment as defined in any one of claims 1 to 7. 

A DNA fragment or DNA transfer vector as claimed in any one of claims 1 to 7 and 10, characterized 
in that one or more natural codons or their cDNA equivalents are replaced by another codon which 
codes for the same amino-acid. 

A DNA transfer vector as claimed in claim 10 or 1 1 , characterized in that it is a bacterial plasmid. 
A DNA transfer vector as claimed in claims 10 or 11, characterized in that it is a bacteriophage DNA. 
-A cell transformed by a transfer vector as claimed in any one of claims 10 to 13. 
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15. A process for making a DNA transfer vector for use in maintaining and replicating a deoxynucleotide 
sequence coding for human H2-preprorelaxin or a sub-unit thereof according to any one of claims 1 to 
7, characterized in that it comprises reacting said deoxynucleotide sequence with a DNA molecule 
prepared by cleaving a transfer vector with a restriction enzyme. 

16. A process for making a fusion protein comprising an amino acid sequence consisting of all or part of 
the amino acid sequence of human H2-prorelaxin as its C-terminal sequence and a protein of a 
prokaryotic protein as its N-terminal sequence, characterised in that it comprises incubating a microor- 
ganism transformed by an expression transfer vector comprising the appropriate deoxynucleotide 
sequence. 

17. A process for synthesizing human H2-prorelaxin comprising the A and B peptides separated from each 
other by a C peptide, characterized in that it comprises incubating a microorganism, transformed by an 
expression transfer vector comprising a deoxynucleotide sequence coding for said human prorelaxin 
under conditions suitable for expression of said sequence coding for human prorelaxin, and purifying 
human prorelaxin from the lysate or culture medium of said microorganism. 

18. A fusion protein comprising an amino acid sequence characterized in that it consists of all or part of the 
amino acid sequence of human H2-prorelaxin as its C-terminal sequence and a portion of a prokaryotic 
protein as its N-terminal sequence. 

19. Synthetic human H2-preprorelaxin having the amino acid sequence as set out in Figure 2. 

20. Synthetic human H2-prorelaxin having the amino sequence as set out in figure 2 with the exception that 
the signal sequence is excluded. 

21. A polypeptide having human H2-relaxin activity, said polypeptide having a disulphide bonded A-chain 
and B-chain comprising the following amino acid sequences: 



A-Chain 




Gin Leu Tyr Ser Ala Leu Ala Asn Lys Cys Cys His VaJL Gly 
Cys Thr Lys Arg Ser Leu Ala Arg Phe Cys 

B-Chain 

Asp Ser Trp Met: Glu Glu Val He Lys Leu Cys Gly Arg Glu 
Leu Val Arg Ala Gin He Ala He Cys Gly Met Ser Thr Trp 
Ser Lys Arg Ser Leu, 



Revendications 

1. Fragment d'ADN codant pour la preprorelaxine H2 humaine, ladite preprorelaxine H2 comportant la 
sequence d'aminoacides representee sur la figure 2. 

2. Fragment d'ADN codant pour la prorelaxine H2 humaine, ladite prorelaxine H2 comportant la sequence 
d'aminoacides representee sur la figure 2, mis a part que la sequence signal est exclue. 

3. Fragment d'ADN codant pour un polypeptide ayant une activite de relaxine H2 humaine, ledit 
polypeptide comportant une chaTne A et une chatne B comprenant les sequences d'aminoacides 
suivantes: 
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Chaine A 

Gin Leu Tyr Ser Ala Leu Ala Asn Lys Cys Cys His Val Gly Cys 
Thr Lys Arg Ser Leu Ala Arg Phe Cys 
Chaine B 

Asp Ser Trp Met Glu Glu Val lie Lys Leu Cys Gly Arg Glu Leu 
Val Arg Ala Gin He Ala He Cys Gly Met Ser Thr Trp Ser Lys 
Arg Ser Leu. 



Fragment d'ADN codant pour les chatnes de peptides signal, A, B ou C de la relaxine H2 humaine ou 
pour une combinaison quelconque de deux desdites chaTnes ou plus; lesdites chatnes peptidiques 
comportant les sequences d'aminoacides suivantes: 

Peptide signal 

Met Pro Arg Leu Phe Phe Phe His Leu Leu Gly Val Cys Leu Leu 
Leu Asn Gin Phe Ser Arg Ala Val Ala 
Chaine A 

Gin Leu Tyr Ser Ala Leu Ala Asn Lys Cys Cys His Val Gly Cys 
Thr Lys Arg Ser Leu Ala Arg Phe Cys 
Chaine B 

Asp Ser Trp Met Glu Glu Val He Lys Leu Cys Gly Arg Glu Leu 
Val Arg Ala G^n lie Ala He Cys Gly Met Ser Thr Trp Ser Lys 
Arg Ser Leu v *> 



Chaine C 

Sar Gin Glu Asp Ala Pro Gin Thr Pro Arg Pro Val Ala Glu 
He Val Pro Ser Phe He Asn Lys Asp Thr Glu Thr He Asn 
Met Met Ser Glu Phe Val Ala Asn Leu Pro Gin Glu Leu Lys 
Leu Thr Leu Ser Glu Met Gin Pro Ala Leu Pro Gin Leu Gin 
Gin His Val Pro Val Leu Lys Asp Ser Ser Leu Leu Phe Glu 
Glu Phe Lys Lys Leu He Arg Asn Arg Gin Ser Glu Ala Ala 

Asp Ser Ser Pro Ser Glu Leu Lys Tyr Leu Gly Leu Asp Thr 

His Ser Arg Lys Lys Arg. 

Fragment d'ADN bicatenaire codant pour la preprorelaxine H2 humaine, caracterise en ce qu'il 
comprend un brin codant et un brin complementaire correspondant a la sequence d'ARNm complete 
suivante: 
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AUG CCU CCC CUG UUU UUU UUC CAC CUG CUA GGA GUC UGU UUA 
CUA CUG AAC CAA UUU UCC AGA GCA GUC GCG GAC UCA UGG AUC 
s GAG GAA GUU AUU AAA UUA UGC GGC CGC GAA UUA GUU CGC GCG 

CAG AUU GCC AUU UGC GGC AUG AGC ACC UGG AGC AAA AGC UCU 
CUG AGC CAG GAA GAU GCU CCU CAG ACA CCU AGA CCA GUG GCA 
GAA AUU GU CCA UCC UUC AUC AAC AAA GAU ACA GAA ACC AUA 
AAU AUG AUG UCA GAA UUU GUU GCU AAU UUG CCA CAG GAG CUG 
AAG UUA ACC CUG UCU GAG AUG CAG CCA GCA UUA CCA CAG CUA 
CAA CAA CAU GUA CCU GUA UUA AAA GAU UCC AGU CUU CUC UUU 
' 5 GAA GAA UUU AAG AAA CUU AUU CGC AAU AGA CAA AGU GAA CCC 

GCA GAC AGC AGU CCU UCA GAA UUA AAA UAC UUA GGC UUG GAU 
ACU CAU UCU CGA AAA AAG AGA CAA CUC UAC AGU GCA UUG GCU 
20 AAU AAA UGU UGC CAU GUU GGU UGO ACC AAA AGA UCU CUU GCU 

AGA UUU UGC UGA 

25 6. Fragment d'ADN bicatenaire codant pour la prorelaxine H2 humaine, caracterise en ce qu'il comprend 
un brin codant et un brin complgmentaire correspondant a la sequence d'ARNm suivante: 

GAC UCA UGG AUG GAG GAA GUU AUU AAA UUA UGC GGC CGC GAA 
3 o UUA GUU CGfe GCG CAG AUU GCC AUU UGC GGC AUG AGC ACC UGG 

AGC AAA AGG UCU CUG AGC CAG GAA GAU GCU CCU CAG ACA CCU 
AGA CCA GUG GCA GAA AUU GUG CCA UCC UUC AUC AAC AAA GAU 
ACA GAA ACC AUA AAU AUG AUG UCA GAA UUU GUU GCU AAU UUG 

35 

CCA CAG GAG CUG AAG UUA ACC CUG UCU GAG AUG CAG CCA GCA 
UUA CCA "AG CUA CAA CAA CAU GUA CCU GUA UUA AAA GAU UCC 
AGU CUV CUC UUU GAA GAA UUU AAG AAA CUU AUU CGC AAU AGA 

40 CAA AGU GCC GCA . GAC AGC AGU CCU UCA GAA UUA AAA UAC 

UUA GGC LUG GAU ACU CAU UCU CGA AAA AAG AGA CAA CUC UAC 
AGU GCA UUG GCU AAU AAA UGU UGC CAU GUU GGU UGU ACC AAA 

45 AGA UCU CUU GCU AGA UUU UGC UGA 

7. Fragment d'ADN bicatenaire codant pour le peptide signal, les chaTnes peptidiques A, B ou C de la 
preprorelaxine H2 humaine ou pour une combinaison de deux quelconques desdites chaTnes ou plus, 
caracterise en ce qu'il comprend un brin codant et un brin complemehtaire correspondant a la 
sequence d'ARNm appropriee ou a une combinaison des sequences d'ARNm donnees ci-dessous: 
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Peptide signal 

AUG CCU CGC CUG UUU UUU UUC CAC CUG 
j CUA CUG AAC CAA UUU UCC AGA GCA GUC 
Chaine A 

CAA CUC UAC AGU GCA UUG GCU AAU AAA 
UGU ACC AAA AGA UCU CUU GCU AGA UUU 
Chaine B 

GAC UCA UGG AUG GAG GAA GUU AUU AAA 
UUA GUU CGC GCG CAG AUU GCC AUU UGC 
AGC AAA AGG UCU CUG 



Chaine C 






















AGC 


CAG 


GAA GAU 


GCU 


CCU 


CAG 


AGA 


CCU 


AGA 


CCA 


GUG 


GCA 


GAA 


AUU 


GUG 


CCA UCC 


UUC 


AUC 


AAC 


AAA 


GAU 


ACA 


GAA 


ACC AUA 


AAU 


AUG 


AUG 


UCA GAA 


UUU 


GUU 


GCU 


AAU 


UUG 


CCA 


CAG 


GAG 


CUG 


AAG 


UUA 


ACC 


CUG UCU 


GAG 


AUG 


CAG 


CCA 


GCA 


UUA 


CCA 


CAG 


CUA 


CAA 


CAA 


CAU 


GUA CCU 


GUA 


UUA 


AAA 


GAU 


UCC 


AGU 


CUU 


CUC 


UUU 


GAA 


GAA 


UUU 


AAG AAA 


CUU 


AUU 


CGC 


AAU 


AGA 


CAA 


AGU 


GAA 


GCC 


GCA 


GAC 


AGC 


AGC^CU. 


UCA 


GAA 


UUA 


AAA 


UAC 


UUA 


GGC 


UUG 


GAU 


ACU 


CAU 


UCU 


CGA^AAA 


AAG 


AGA 



















8. Precede pour la production d'un fragment d'ADN selon I'une quelconque des revendications 1 a 7, 
caracterise en ce qu'il comprend le criblage d'une banque de clones d'ADNc humains avec utilisation 
en tant que sonde d'un fragment d'ADN de'relaxine H1 humaine. 

9. Procede selon la revendication 8, caracterise en ce que ledit fragment comprend des nucleotides de la 
region codant pour le peptide C/peptide A de I'ADN de la relaxine H1 humaine. 

10. Vecteur de transfert d'ADN, caracterise en ce qu'il contient une sequence de desoxynucleotides 
d'ADNc correspondant a un gene ou fragment d'ADN tel que defini dans Tune quelconque des 
revendications 1 a 7. 

11- Fragment d'ADN ou vecteur de transfert d'ADN selon I'une quelconque des revendications 1 a 7et 10, 
caracterise en ce qu'un ou piusreurs codons natureis ou leurs equivalents d'ADNc sont remplaces par 
un autre codon qui code pour le meme aminoacide. 

12. Vecteur de transfert d'ADN selon la revendication 10 ou 11, caracterise en ce qu'il est un plasmide 
bacterien. 

13. Vecteur de transfert d'ADN selon la revendication 10 ou 11, caracterise en ce qu'il est un ADN de 
bacteriophage. 

14. Cellule transformee par un vecteur de transfert selon I'une quelconque des revendications 10 a 13. 

15. Procede pour la production d'un vecteur de transfert d'ADN pour utilisation dans le maintien et la 



CUA GGA GUC UGU UUA 
GCG 

UGU UGC CAU GUU GGU 
UGC 

UUA UGC GGC CGC GAA 

GGC AUG AGC ACC UGG 
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replication d'une sequence de desoxynucleotides codant pour la preprorelaxine H2 humaine ou une 
sous-unite de celle-ci selon Tune quelconque des revendications 1 a 7, caracterise en ce qu'il 
comprend la mise en reaction d'une telle sequence de desoxynucleotides avec une molecule d'ADN 
obtenue par coupure d'un vecteur de transfert par une enzyme de restriction. 

5 

16. Procede pour la production d'une proline fusionnee comprenant une sequence d'aminoacides consti- 
tute de I'ensemble ou d'une partie de la sequence d'aminoacides de la prorelaxine H2 humaine en tant 
que sa sequence C-terminale, et une portion de proteine de procaryote en tant que sa sequence N- 
terminale, caracterise en ce qu'il comprend I'incubation d'un micro-organisme transforme par un 

w vecteur de transfert d'expression comprenant la sequence de desoxynucleotides appropriee. 

17. Procede pour la synthese de prorelaxine H2 humaine comprenant les peptides A et B separes Tun de 
I'autre par un peptide C, caracterise en ce qu'il comprend I'incubation d'un micro-organisme transforme 
par un vecteur de transfert d'expression comprenant une sequence de desoxynucleotides codant pour 

is ladite prorelaxine humaine, dans des conditions appropriees a I'expresston de ladite sequence codant 
pour la prorelaxine humaine, et la purification de la prorelaxine humaine a partir du lysat ou du milieu 
de culture dudit micro-organisme. 

18. Proteine fusionnee comportant une sequence d'aminoacides, caracterisee en ce qu'elle consiste en 
20 I'ensemble ou une partie de la sequence d'aminoacides de la prorelaxine H2 humaine en tant que sa 

sequence C-terminale, et une portion d'une proteine de procaryote en tant que sa sequence N- 
terminale. 

19. Preprorelaxine H2 humaine de synthese, comportant la sequence d'aminoacides telle que representee 
25 sur la figure 2. 

20. Prorelaxine H2 humaine de synthese, comportant la sequence d'aminoacides telle que representee sur 
la figure 2, mis a part que !a sequence signal est exclue. 

30 21. Polypeptide ayant une actrvtte de relaxine H2 humaine, ledit polypeptide comportant une chaTne A et 
une chaTne B liees par pont disulfure, comprenant les sequences d'aminoacides suivantes: 

Chaine A 

35 Gin Leu Tyr Ser Ala Leu Ala Asn Lys Cys Cys His Val Gly Cys 
Thr Lys Arg Ser Leu Ala Arg Phe Cys 
Chaine B 
Asp Ser Trp Met 

40 

Val Arg Ala Gin 
Arg Ser Leu . 

45 

Anspruche 

1. Menschliches H2-Praprorelaxin kodierendes DNA-Fragment, wobei besagtes H2-Praprorelaxin die in 
so Figur 2 dargestellte Aminosauresequenz besitzt. 

2. Menschliches H2-Prorelaxin kodierendes DNA-Fragment, wobei besagtes H2-Prorelaxin die in Figur 2 
dargestellte Aminosauresequenz besitzt, mit der Ausnahme, dafi die Signalsequenz ausgeschlossen ist. 

55 3. DNA-Fragment, das ein Polypeptid kodiert, welches die Aktivitat von menschlichem H2-Relaxin besitzt, 
wobei besagtes Polypeptid eine A-Kette und eine B-Kette besitzt, welche die folgenden Aminosaurese- 
quenzen umfassen: 



Glu Glu Val lie Lys Leu Cys Gly Arg Glu Leu 
lie Ala lie Cys Gly Met Ser Thr Trp Ser Lys 
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A-Kette 

Gin Leu Tyr Ser Ala Leu Ala Asn Lys Cys Cys His Val Gly 
Cys Thr Lys Arg Ser Leu Ala Arg Phe Cys 



B-Kette 

Asp Ser Trp Met Glu Glu Val 
Leu Val Arg Ala Gin lie Ala 
Ser Lys Arg Ser Leu. 



lie Lys Leu Cys Gly Arg Glu 
lie Cys Gly Met Ser Thr Trp 



DNA-Fragment, das die Signal-, A-, B- oder C-Peptidketten von menschlichem H2-Relaxin oder 
irgendeine Kombination von zwei oder mehr besagter Ketten kodiert; wobei besagte Peptidketten die 
folgenden Aminosauresequenzen besitzen: 

Signal-Peptid 

Met Pro Arg Leu Phe Phe Phe His Leu Leu Gly Val Cys Leu 
Leu Leu Asn Gin Phe Ser Arg Ala Val Ala 



A-Kette 

Gin Leu Tyr Ser Ala leu Ala 
Cys Thr Lys~\£rg Ser Leu Ala 



B-Kette 



Asp 


Ser 


Trp 


Met 


GlU 


Glu 


Val 


Leu 


Val 


Arg 


Ala 


Gin 


He 


Ala 


Ser 


Lys 


Arg 


Ser 


Leu 






C-Kette 












Ser 


Gin 


Glu 


Asp 


Ala 


Pro 


Gin 


He 


Val 


Pro 


Ser 


Phe 


He 


Asn 


Met 


Met 


Ser 


GlU 


Phe 


Val 


Ala 


Leu 


Thr 


Leu 


Ser 


Glu 


Met 


Gin 


Gin 


His 


Val 


Pro 


Val 


Leu 


Lys 


Glu 


Phe 


Lys 


Lys 


Leu 


He 


Arg 


Asp 


Ser 


Ser 


Pro 


Ser 


Glu 


Leu 


Ser 


Arg 


lys 


Lys 


Arg 







Asn Lys Cys Cys His Val Gly 
Arg Phe Cys 



He 


Lys 


Leu 


Cys 


Gly 


Arg 


Glu 


He 


Cys 


Gly 


Met 


Ser 


Thr 


Trp 


Thr 


Pro 


Arg 


Pro 


Val 


Ala 


Glu 


Lys 


Asp 


Thr 


Glu 


Thr 


He 


Asn 


Asn 


Leu 


Pro 


Gin 


Glu 


Leu 


Lys 


Pro 


Ala 


Leu 


Pro 


Gin 


Leu 


Gin 


Asp 


Ser 


Ser 


Leu 


Leu 


Phe 


Glu 


Asn 


Arg 


Gin 


Ser 


Glu 


Ala 


Ala 


Tyr 


Leu 


Gly 


Leu 


Asp 


Thr 


His 



Doppelstrangiges, menschliches H2-PraproreIaxin kodierendes DNA-Fragment dadurch gekennzeich- 
net, dafl es einen kodierenden Strang und einen komplementaren Strang entsprechend der folgenden 
vollstandigen mRNA-Sequenz umfaflt: 
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AUG 


ecu 


CGC 


CUG 


UUU 


UUU 


UUC 


CAC 


CUG 


CUA 


GGA 


GUC 


UGU 


UUA 


CUA 


CUG 


AAC 


CAA 


UUU 


UCC 


AGA 


GCA 


GUC 


GCG 


GAC 


UCA 


UGG 


AUG 


GAG 


GAA 


GUU 


AUU 


AAA 


UUA 


UGC 


GGC 


CGC 


GAA 


UUA 


GUU 


CGC 


GCG 


CAG 


AUU 


GCC 


AUU 


UGC 


GGC 


AUG 


AGC 


ACC 


UGG 


AGC 


AAA 


AGG 


UCU 



CUG 


AGC 


CAG 


GAA 


GAU 


GCU 


ecu 


GAA 


AUU 


GUG 


CCA 


UCC 


UUC 


AUC 


AAU 


AUG 


AUG 


UCA 


GAA 


UUU 


GUU 


AAG 


UUA 


ACC 


CUG 


UCU 


GAG 


AUG 


CAA 


CAA 


CAU 


GUA 


ecu 


GUA 


UUA 


GAA 


GAA 


UUU 


AAG 


AAA 


CUU 


AUU 


GCA 


GAC 


AGC 


AGU 


ecu 


UCA 


GAA 


ACU 


CAU 


UCU 


CGA 


AAA 


AAG 


AGA 


AAU 


AAA 


UGU 


UGC 


CAU 


GUU 


GGU 


AGA 


UUU 


UGC 


UGA 









CAG 


ACA 


ecu 


AGA 


CCA 


GUG 


GCA 


AAC 


AAA 


GAU 


ACA 


GAA 


ACC 


AUA 


GCU 


AAU 


UUG 


CCA 


CAG 


GAG 


CUG 


CAG 


CCA 


GCA 


UUA 


CCA 


CAG 


CUA 


AAA 


GAU 


UCC 


AGU 


CUU 


cue 


UUU 


CGC 


AAU 


AGA 


CAA 


AGU 


GAA 


GCC 


UUA 


AAA 


UAC 


UUA 


GGC 


UUG 


GAU 


CAA 


cue 


UAC 


AGU 


GCA 


UUG 


GCU 


UGU 


ACC 


AAA 


AGA 


UCU 


CUU 


GCU 



Doppelstrangiges, menschliches H2-Prorelaxin kodierendes DNA-Fragment, dadurch gekennzeichnet, 
daj3 es einen kodierenden Strang und einen komplementaren Strang entsprechend der folgenden 
mRNA-Sequenz umfafitfu: 



GAC 


UCA 


UGG 


AUG 


GAG 


GAA 


GUU 


AUU 


AAA 


UUA 


UGC 


GGC 


CGC 


GAA 


UUA 


GUU 


CGC 


GCG 


CAG 


AUU 


GCC 


AUU 


UGC 


GGC 


AUG 


AGC 


ACC 


UGG 


AGC 


AAA 


AGG 


UCU 


CUG 


AGC 


CAG 


GAA 


GAU 


GCU 


ecu 


CAG 


ACA 


ecu 


AGA 


CCA 


GUG 


GCA 


GAA 


AUU 


GUG 


CCA 


UCC 


UUC 


AUC 


AAC 


AAA 


GAU 


ACA 


GAA 


ACC 


AUA 


AAU 


AUG 


AUG 


UCA 


GAA 


UUU 


GUU 


GCU 


AAU 


UUG 


CCA 


CAG 


GAG 


CUG 


AAG 


UUA 


ACC 


CUG 


UCU 


GAG 


AUG 


CAG 


CCA 


GCA 


UUA 


CCA 


CAG 


CUA 


CAA 


CAA 


CAU 


GUA 


ecu 


GUA 


UUA 


AAA 


GAU 


UCC 


AGU 


CUU 


cue 


UUU 


GAA 


GAA 


UUU 


AAG 


AAA 


CUU 


AUU 


CGC 


AAU 


AGA 


CAA 


AGU 


GAA 


GCC 


GCA 


GAC 


AGC 


AGU 


ecu 


UCA 


GAA 


UUA 


AAA 


UAC 


UUA 


GGC 


UUG 


GAU 


ACU 


CAU 


UCU 


CGA 


AAA 


AAG 


AGA 


CAA 


cue 


UAC 


AGU 


GCA 


UUG 


GCU 


AAU 


AAA 


UGU 


UGC 


CAU 


GUU 


GGU 


UGU 


ACC 


AAA 


AGA 


UCU 


CUU 


GCU 


AGA 


UUU 


UGC 


UGA 















Doppelstrangiges DNA-Fragment, das die Signalpeptid-, A-, B- oder C-Peptidketten von menschfichem 
H2-Praprorelaxin oder eine Kombination irgendwelcher zwei oder mehreren besagter Ketten kodiert, 
dadurch gekennzeichnet, da/3 es einen kodierenden Strang und einen komplementaren Strang entspre- 
chend der passenden mRNA-Sequenz oder der Kombination der unten angegebenen mRNA-Sequen- 
zen umfafit: 
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^^ UG uuu uuu uuc cac cue cua gga cue ugu uua 

CUA CUG AAC CAA UUU UCC AGA GCA GUC GCG 

„AC AGU GCA UUG GCU AAU AAA UGU UGC CAU GUU GGU 
UGU ACC AAA AGA UCU CUU GCU AGA UUU UGC 

SfS UGG AUG GAG GAA GUU AUU AAA UUA UGC GGC C=C CAA 
UUA GUU CGC GCG CAG AUU GCC AUU UGC GGC AUG 
AGC AAA AGG UCU CUG 

— Z E £ S Z AAA S £ E S ASA £ 

Tg AUG UCA GAA £ ^ GCU AAU UUG CCA «, £ - - 
UUA ACC CUG UCU GAG AUG CAG CCA GCA -A CCA CAG ^ 
CAA CAU GUA CCU GUA UUA AAA GAU UCC AGU ^ ^ 

GAA UUU AAG A^A CUU AUU CGC AAU AGA CAA A 
GAC AGC AGU CCU UCA GAA UUA AAA UAC UUA GGC 
CAU UCU CGA AAA AAG AGA 

> ^ . besaate Fragment Nukleot.de der C- 

9 . Verfahren nach Anspruch 8. dadurch ^^Z^S^ 
Peptid/A-Peptid-Kodierungsreg.on der menschl.che 

rDNA-Desoxynukleotidsequenz enthalt, die 
a -„h P 1 bis 7 und 10, dadurch gekenn- 

12 . DNA-Transfervektor nach Anspruch 10 oder 11. a 
Piji^fY^id ist 

k a nruch 10 Oder 1 1 dadurch gekennzeichnet, dafl er eine Bakteriophagen- 

13. DNA-Transfervektor nach Anspruch 10 oder 
DNA ist. 
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15. Verfahren zum Herstellen eines DNA-Transfervektors zur Verwendung bei der Erhaltung und Replika- 
tion einer Desoxynukleotidsequenz, die fur menschliches H2-Praprorelaxin Oder eine Untereinheit 
desselben nach einem der Anspruche 1 bis 7 kodiert, dadurch gekennzeichnet, dai3 es das Umsetzen 
besagter Desoxynukleotidsequenz mit einem DNA-Molekul umfam, das durch Schneiden eines Trans- 

s fervektors mit einem Restriktionsenzym hergestellt ist. 

16. Verfahren zum Herstellen eines Fusionsproteins, das eine Aminosauresequenz umfam, die in ihrer 
Gesamtheit oder zum Teil aus der Aminosauresequenz von menschlichen H2-Prorelaxin als ihrer C- 
terminalen Sequenz und einem Teil eines prokaryontischen Proteins als ihrer N-terminalen Sequenz 

w besteht, dadurch gekennzeichnet, da0 es das Inkubieren eines Mikroorganismus umfam, der von einem 

Expressions-Transfervektor transformiert ist, welcher die passende Desoxynukleotidsequenz umfai3t. 

17. Verfahren zum Synthetisieren von menschlichem H2-Prorelaxin, das die A- und B-Peptide, von einem 
C-Peptid voneinander getrennt, umfam, dadurch gekennzeichnet, da/3 es das Inkubieren eines Mikroor- 

15 ganismus, der von einem Expressions-Transfervektor transformiert ist, welcher eine Desoxynukleotidse- 

quenz umfam, die fur besagtes menschliches Prorelaxin kodiert, unter fur die Expression besagter fur 
menschliches Prorelaxin kodierenden Sequenz geeigneten Bedingungen, und das Reinigen menschli- 
chen Prorelaxins aus dem Lysat oder Kulturmedium von besagtem Mikroorganismus umfam. 

20 18. Fusionsprotein mit einer Aminosauresequenz, dadurch gekennzeichnet, dafi sie in ihrer Gesamtheit 
oder zum Teil aus der Aminosauresequenz von menschlichem H2-Prorelaxin als ihrer C-terminalen 
Sequenz mit einem Teil eines prokaryontischen Proteins als ihrer N-terminalen Sequenz besteht. 

19. Synthetisches menschliches H2-Praprorelaxin mit der in Figur 2 dargestellten Aminosauresequenz. 

25 

20. Synthetisches menschliches H2-Prorelaxin mit der in Figur 2 dargestellten Aminosauresequenz, mit der 
Ausnahme, da/3 die Signal-Sequenz ausgeschlossen ist 

21. Polypeptid mit der Aktivitgt von menschlichem H2-Reiaxin, wobei besagtes Polypeptid eine disulfidge- 
30 bundene A-Kette und B-K|tte aufweist, welche die folgenden Aminosauresequenzen umfassen: 



A-Kette 



35 



-Gin Leu Tyr 
Cys Thr Lys 



Arg 



Ser 



Ala Leu Ala Asn Lys Gys Cys His Val Gly 
Ser Leu Ala Arg Phe Cys 



B-Kette 



40 



Asp Ser Trp 
Leu Val Arg 
Ser Lys Arg 



Ala 



Met 



Ser 



Glu Glu Val lie Lys Leu Cys Gly Arg Glu 
Gin lie Ala lie Cys Gly Met Ser Thr Trp 
Leu. 



45 



50 



55 
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8 



CO 

'1 



WW 



-tf/AF — 



CO 



-HINF 



1 



HINF 
\—HINF 



IALUIt 



SSI 



AVAI 



goo 
<8 a 



XT 



c 



8i 



is 
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^Signal Peptide 

fee Pro 

AUG OCU 



chain 



joccacuuu 



AUG ecu ax cue uuc 
Met Pro Arg Uu Phe 



Phe 
UUU 
** 

UUC 
Leu 



UUC CAC CUC CUA 

^ ****** *** 

uuc CAC ac CUA 
fhc His Leu Uu 




-10 

Cysl^L^l£uAsnCanI^SerArKAlaVal/aa 
l£UUiAOJAaJCAACCAAUUUlXX:^^ajCOT 



-10 

l£U 

CUC AAC CAA 

*** *** *** *** *** *** *** *** *** *** *™ 

UOJUUAOJACUCAACC^UJUyXi^C^^OK 
Cys Leu Leu Leu Asn GLa Pie Scr Arg Ala Val Ala 




Met GIu Clu 
AUGGAGGAA 
* * ** ** 

AAC CAC CAU 
Lya Asp Asp 



Val lie Lys Leu Cys CW tot Clu Leu Val 
CUU ALU AAA UUA IOC GX OGC 



20 

Ala Cln He Ala He Cys CLy Met Ser Thr Tftj| Ser Lys 
iCnJCACAUUGXAUUUOCOX/UC^AGCUCC/^ 



CAA UUA GUU 

^ ^ & l£ ^ ^ S ^ ^ Val Arg A^ ^ 



Arg Ser 
AGC UCU 

AGC UCU 
Arg Ser 



Leu 
CUC 
*** 

ac 

Leu 



C-pe ptide 

*** *** ^ !^ 7Z- «-a r^L Mil nil CCA 1EC UUC ALE AAC 



rc^^^COJCCUCACACACaJ/^CCACUSGCAGAA/^ClWCC^ 
Ser Qji Su Mp Ala Pro Gin Thr Pro Arg Pro Val Ala Qu He Val Pro Ser ftie lie Asn 



lys Asp Thr 
AAA CAU ACA 

*** *** *** 

AAA CALJ PCn. 
Lys Asp Thr 



Chi Thr lie 
.CAA ACC AHA 



Asn Met 1 
AAU AUG 
* * ** 

CAA ACU AUaIaUU AUC 
GLu thr He He lie 



Met 
AUG 
*** 

ALC 
Met 



[Serl 
UCA 
* 

uc 

Leu 



CAA UUU 
*** ** 

CAA UUC 
GLu Ihe 



GUU 
** 

AUU 
He 



70 



GLu FhelVallALa Asa leu Pro [On 



OCU AAU UC OCA 
*** *** *** *** 

OCU AAU UC CCA 
Ala Asn leu Pro 



ac 
* * 

ac 

Pro 



GIu Leu Lys 
G\C OC Ar£ 

*** *** *** 

ac aac 

GLu Leu Lys 



Leu Thr 
UUA ACC 
* ** 

oca or 

Ala Ala 



Leu Scr GLu 

CUA UCU GAG 
Leu Ser GLu 



A 






Met 


Girt Pro 


Ala 


A1C 


OC CCA 


OCA 


* * 




JT* 


/CC 


CAA CCA 


UCA 




Gin Pro 


Ser 



Leu Pro 
UUA GG\ 



Gin 



UUA CCA G£ 



Leu Pro 



GIu 



Leu GLn Gin [His 
CUA CAA CAA CAU 



CAU 



CCA CAC CAC.- _ 

Leu Cln Qn[Tyr|Val Pro 



Val Pro 
CUA CCU 
*** *** 

GUA OCU 



VaTlLeu Lya Asp Ser 
CUA UUV AAA gau UOC 
* * *** ** *** *** 

OCA UUA AAC CAU UOC 
AlaJLeu Lys Asp Ser 



Ser 
/GU 
* 

AAU 
Asn 



Leu 
CUU 

CUU 



Lev? 

CUC 
* 

PCC 



Leu Ser 



110 

ftie Clu Clu Phe Lys Lys Leu lie Arc Asn 
UUU CAk GAA UJJ AAC AAA OXJ AUU CDC AAU 
*** *** *** *** *** *** *** *** *** *** 
UUUGAAGAAUUUAACAAACUUAijUaXAAU 
Phe GIu Clu ftie Lys Lys Leu He Arg Asn 

A- chain 



Are Cln Ser GIu Ala Ala Asp Ser 

** *** *** *** *** *** *** *** 

/OCCAAiCUGAACOCOCAGACAOC 
^rg Gin Ser GIu Ala Ala Asp Ser 



Ser 

* * 

AAU 
Asn 



120 



130 



Pro Ser Clu Leu Lys TYr Leu GLy Leu Asp Thr His Ser 
SlCA^AUUAA^u^UUAatUCG^^^Uaj 
*** *** *** *** *** *** *** *** *** *** *** *** *** 
CraJUCACAAumAAAUACUIAOXUCGAUAOJCAUra 
Pro Ser GIu Leu Lys iyr Leu GLy Leu Asp Thr His Ser 



* 

CAA 
Gin 



Lys Lys Atr 
AAA ArG AGA 

^VC^fc ^C^^k ^nirA 

AAA AAG AGA 
Lys Lys Arg 



Gin 
G\A 
* * 

CA 



Teu 
CUC 
* * 

C0C 
Pro 



Tyr^- 
UAC /CU 



UAC 



ac 



Val 



Ala leu 
OCA UC 

OCA OC 
Ala Leu 



Ala Asn 
OCU AAU 
* * 

UUU GAG 
ftie Clu 



Lys Cys Cys 
AAA UCU IDC 



His Val 
CAU CUU 
* 

AAA ICU UGClCUA AID 
Lys C!ys Cys lLeu He 



150 

Cly Cys Thr Lys Arg Ser leu Ala 
GCUUCUACCAAA/KUCUCUUGCU 

**^ ********* ** ********* 

OCUUCU/CCAAAAOCUCUCUUGCU 
Cly Cys Thr lys Arg Ser Leu Ala 



Am Ftte 
/GAUJU 
* * * * 

AAA UAU 
Lys TYr 



160 
Cvs *** 

UOC USA 

UOC UCA 

cys *** 



FIGURE 2. 
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